Distributed, Expressive Top-k Subscription Filtering using Covering in Publish/Subscribe Systems
نویسندگان
چکیده
Top-k filtering is an effective way of reducing the amount of data sent to subscribers in pub/sub applications. In this paper, we investigate top-k subscription filtering, where a publication is delivered only to the k best ranked subscribers. The naive approach to perform filtering early at the publisher edge broker works only if complete knowledge of the subscriptions is available, which is not compatible with the wellestablished covering optimization in publish/subscribe systems. We propose an efficient rank-cover technique to reconcile top-k subscription filtering with covering. We extend the covering model to support top-k and describe a novel algorithm for forwarding subscriptions to publishers while maintaining correctness. We also establish a framework for supporting different types of ranking semantics, such as fairness and diversity. Finally, we conduct an experiential evaluation and perform sensitivity analysis to demonstrate that our optimized rank-cover algorithm retains both covering and fairness while achieving properties advantageous to our targeted workloads.
منابع مشابه
Efficient Filtering and Routing in a Scalable XML-Based Publish-Subscribe System
This paper introduces YAK – a scalable contentbased publish-subscribe system. YAK employs XML documents and expressive XPath queries as the publication and subscription model. To achieve high scalability, it combines the advantages of content routing in existing publish-subscribe systems and the efficient query indexing technique in the context of XML filtering. The filtering and routing strate...
متن کاملThe PADRES Distributed Publish/Subscribe System
Workflow management systems are traditionally centralized, creating a single point of failure and a scalability bottleneck. In collaboration with Cybermation, Inc., we have developed a content-based publish/subscribe platform, called PADRES, which is a distributed middleware platform with features inspired by the requirements of workflow management and business process execution. These features...
متن کاملEfficient Distributed Event Processing using Subscription Summaries in Large Scale Publish/Subscribe Systems
A key issue when designing and implementing large-scale publish/subscribe systems is how to efficiently propagate user subscriptions among the brokers of the system. In this paper we contribute the notion of broker subscription summaries and accompanying distributed and scalable algorithms for subscription summary propagation and event filtering and routing. In addition we present a performance...
متن کاملTop-k/w publish/subscribe: A publish/subscribe model for continuous top-k processing over data streams
Continuous processing of top-k queries over data streams is a promising technique for alleviating the information overload problem as it distinguishes relevant from irrelevant data stream objects with respect to a given scoring function over time. Thus it enables filtering of irrelevant data objects and delivery of top-k objects relevant to user interests in real-time. We propose a solution for...
متن کاملIndexing for Subscription Covering in Publish-Subscribe Systems
Content based publish-subscribe systems are being increasingly used to deliver information in large distributed environments. Subscription covering is an effective way to reduce the complexity of content-based routing and avoid unnecessary proliferation of subscriptions throughout the system. Although covering detection has been implemented in current systems, their efficient implementation has...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015